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ABSTRACT 

Single-stranded DNA overhangs at the ends of 
human telomeric repeats are capable of adopting 
four-stranded G-quadruplex structures, which 
could serve as potential anticancer targets. Out of 
the five reported intramolecular human telomeric 
G-quadruplex structures, four were formed in the 
presence of K + ions and only one in the presence 
of Na + ions, leading often to a perception that this 
structural polymorphism occurs exclusively in the 
presence of K + but not Na + . Here we present the 
structure of a new antiparallel (2+2) G-quadruplex 
formed by a derivative of a 27-nt human telomeric 
sequence in Na + solution, which comprises a novel 
core arrangement distinct from the known 
topologies. This structure complements the previ- 
ously elucidated basket-type human telomeric G- 
quadruplex to serve as reference structures in 
Na-containing environment. These structures, 
together with the coexistence of other conform- 
ations in Na + solution as observed by nuclear 
magnetic resonance spectroscopy, establish the 
polymorphic nature of human telomeric repeats 
beyond the influence of K + ions. 

INTRODUCTION 

Telomeres, the ends of linear eukaryotic chromosomes, 
perform vital roles in the maintenance of genome integrity 
and the regulation of cell proliferation (1). Human telo- 
meric DNA consists of thousands of tandem repeats of 
(TTAGGG) n sequences (2,3), terminating as single- 
stranded DNA of 100-200 nt at the 3'-ends (4). Such 



guanine-rich repeats are capable of adopting four- 
stranded G-quadruplex structures (5-8) under 
physiological conditions. G-quadruplex formation at the 
telomeric ends has been shown to inhibit the activity of the 
enzyme telomerase (9), which is essential for the prolifer- 
ation of most cancer cells (10,11). Insights from high- 
resolution G-quadruplex structures adopted by these se- 
quences could thus aid in drug design efforts targeting 
telomeric DNA for potential anticancer treatment (12,13). 

To date, at least five distinct intramolecular 
G-quadruplex folding topologies (Supplementary Figure 
SI) have been reported for natural human telomeric 
repeats (14-23), four of which were observed in the 
presence of K + ions. Form 1 (15-17) and Form 2 (18,19), 
observed in K + solution, both comprise the (3+1) G-tetrad 
core, but differ in procession of the different loops; Form 3 

(20) , also observed in K + solution, consists of a two- 
G-tetrad basket-type core with extensive base stacking 
interactions in the loops. In a K + -containing crystal 

(21) and under water-depleted conditions (22,24-26), 
the all-parallel-stranded propeller-type G-quadruplex 
structure was shown to be the preferred form. On the 
other hand, the only conformation to have been 
characterized in Na + solution consists of a three-layered 
basket-type G-quadruplex (23). Myriad studies on human 
telomeric G-quadruplexes have been carried out using a 
wide variety of techniques (27^11), ranging from ultravio- 
let (UV) (27), circular dichroism (CD) (28), fluorescence 
(29-31), Raman (32), electron paramagnetic resonance 
(33) and nuclear magnetic resonance (NMR) spectros- 
copy (34), to X-ray crystallography (21), mass spectrom- 
etry (35), gel electrophoresis (36), single-molecule Forster 
resonance energy transfer (37,38), laser-tweezers pulling 
(39), calorimetry (40) and molecular dynamics (MD) simu- 
lations (41), and the results have revealed the polymorphic 
nature of human telomeric repeats in the presence of K + 
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ions. While not fully addressed, this structural polymorph- 
ism has also been implicated in Na + solution (29,36,37), y e t 
the basket form has largely been used as the sole reference 
structure for interpretation of studies under Na + -contain- 
ing environment. Knowledge on the structural disparities, 
if any, in the presence of Na + ions would contribute toward 
fundamental understanding of the cationic influence on 
human telomeric repeats under different contexts. Using 
NMR, here we show that four-repeat human telomeric se- 
quences could adopt multiple G-quadruplex conform- 
ations in Na + solution. We present the solution structure 
of one of these forms, which exhibits features distinct from 
the known topologies. This latest addition to the ensemble 
of telomeric G-quadruplex structures establishes the 
polymorphic nature of human telomeric repeats beyond 
the influence of K + ions, and builds on the available pool 
of structural motifs for targeting of these important 
entities. 

METHODS 

Sample preparation 

Unlabeled (Supplementary Table SI) and site-specific 
labeled (2%- 15 N-enriched or 2 H-labeled; Supplementary 
Table S2) DNA oligonucleotides were chemically 
synthesized on an ABI 394 DNA/RNA synthesizer using 
products from Glen Research, Spectra Gases and 
Cambridge Isotope Laboratories. The oligonucleotides 
were de-protected following the manufacturer's protocols 
and purified using Poly-Pak™ cartridges. Samples were 
dialyzed successively against water, ~25mM NaCl solu- 
tion and water again. They were subsequently frozen, 
lyophilized and suspended in a buffer containing 20 mM 
sodium phosphate (pH 7.0) and 70 mM NaCl. DNA 
concentration is expressed in strand molarity using a 
nearest-neighbor approximation for the absorption coeffi- 
cients of the unfolded species (42). The same extinction 
coefficient was used for the natural and Bl G-substituted 
oligonucleotides. 

UV spectroscopy 

Thermal stability of different DNA oligonucleotides was 
characterized by recording the UV absorbance at 295 nm 
(43) as a function of temperature (20-90° C) using a 
JASCO V-650 spectrophotometer. The heating and 
cooling rates were 0.2°C/min. Two baselines correspond- 
ing to the completely folded (low temperature) and com- 
pletely unfolded (high temperature) states were manually 
drawn to determine the fractions of folded and unfolded 
species during the melting/folding transition. The melting 
temperature (T m ) is defined as the temperature for which 
there are equal fractions of folded and unfolded species. 
Experiments were performed with quartz cuvettes (1-cm 
pathlength for low DNA concentrations and 0.2-cm 
pathlength for high DNA concentrations). 

CD spectroscopy 

CD spectra were recorded at 20° C on a JASCO-815 
spectropolarimeter over the range of 220-320 nm using a 



1-cm pathlength quartz cuvette with a reaction volume of 
600 ul. For each sample, an average of three scans was 
taken, the spectrum of the buffer was subtracted and the 
data were zero-corrected at 320 nm. 

NMR spectroscopy 

Strand concentration of the NMR samples was typically 
0.5-2.0 mM. NMR experiments were performed on 
Bruker AVANCE 600- and 700-MHz spectrometers at 
25°C, unless otherwise specified. Resonances for guanine 
residues were assigned unambiguously using site-specific 
low-enrichment 15 N labeling (44), site-specific 2 H 
labeling (45) and through-bond correlations at natural 
abundance (46,47). Spectral assignments were assisted by 
NOESY, COSY, TOCSY and {'H- 13 C}-HSQC experi- 
ments. The spectra were processed with the software 
TopSpin and analyzed using the program FELIX (Felix 
NMR, Inc.). 

Structure calculation 

Inter-proton distances for the d[(TTAGGG) 3 TTA( Br G)G 
GTTA] (htel27[Br22]) G-quadruplex were deduced from 
NOESY experiments performed in H 2 0 (mixing time, 
200 ms) and D 2 0 (mixing times, 100 and 350 ms). 
Structures were calculated based on distance-restrained 
molecular dynamics refinement following distance 
geometry simulated annealing using the program 
XPLOR-NIH (48). Hydrogen bond restraints, inter- 
proton distance restraints, dihedral restraints, planarity 
restraints and repulsive restraints were imposed during 
structure calculations. Structures were displayed using 
the program PyMOL (49). Detailed procedures for struc- 
ture calculation are described in Supplementary Text. 

Data deposition 

The coordinates for the d[(TTAGGG) 3 TTA( Br G)GGTTA] 
(htel27[Br22J) G-quadruplex have been deposited in the 
Protein Data Bank (accession code 2MBJ). 

RESULTS AND DISCUSSION 

Four-repeat human telomeric sequences adopt diverse 
G-quadruplex conformations in Na + solution 

NMR spectra of four-repeat natural human telomeric se- 
quences with different flanking ends (Supplementary 
Table SI) in Na + solution indicated the adoption of 
multiple conformations (Supplementary Figure S2). In 
agreement with the previous study (23), the sequence 
d[AGGG(TTAGGG) 3 ] (denoted as htel22; Table 1) dis- 
played 12 major imino proton peaks at ~10.6-12.0ppm 
(Figure 1A), corresponding to the establishment of the 
basket-type G-quadruplex (Supplementary Figure S1A). 
In addition, two sequences with the same 5'-flanking end 
as htel22 showed a major species with a highly similar 
spread of imino proton peaks (Supplementary Figure 
S2F and G), suggesting their adoption of the same 
topology as the prevailing form. The other sequences 
mostly displayed the coexistence of two or more major 
species, judging from the number and intensity of imino 
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proton peaks. CD spectra of all 16 sequences displayed 
two positive maxima at ~250 and ~295nm, and a 
trough at ~260-270nm (Supplementary Figure S3). The 
295-nm peak is characteristic of opposite-polarity stacking 
of G-tetrads (50), suggesting that these sequences largely 
conform to antiparallel G-quadruplexes in Na + solution. 
Note that htel22 produced a sharp negative trough at 
~260nm (Figure IB), which was less intense in the other 
sequences. 

Formation of a G-quadruplex with an alternative 
conformation distinct from htel22 

We chose to probe the structures present in the 
sequence d[(TTAGGG) 4 TTA] (denoted as htel27; 
Table 1) (Figure 1A), which has the largest number of 
flanking nucleotides for potential interactions. 
Previously, 8-bromoguanine ( Bl G)-for-G substitution has 
been used to drive the folding of an oligonucleotide 
toward a single major G-quadruplex structure by 
favoring the syn glycosidic conformation at the 
substituted residue (17,18,20,51). Here, sing le Br G incorp- 
oration at position G22 of htel27, giving rise to the 
sequence d[(TTAGGG) 3 TTA( Br G)GGTTA] (denoted as 
htel27[Br22]; Table 1), led to the emergence of a predom- 
inant species (>85%) amenable for detailed structural 
characterization (Figure 1A). Htel27[Br22] showed 12 
major imino proton peaks at ~ 10.8 1 1.8 ppm, indicating 
the adoption of a three-layered G-quadruplex. The CD 
spectra of hte!27[Br22] and htel27 were highly similar 
(Figure IB), suggesting that htel27[Br22] could corres- 
pond to the preexisting major conformation(s) in htel27, 
or that the core topologies of htel27[Br22] and Iitel27 



Table 1. Representative natural and modified human telomeric 
sequences used in this study' 1 



Name 



Sequence 



htel22 
hteW 

htel27[Br22] 



A GGG TTA GGG TTA GGG TTA GGG 
TTA GGG TTA GGG TTA GGG TTA GGG TTA 
TTA GGG TTA GGG TTA GGG TTA Br GGG TTA 



"The modified 8-bromoguanine ( G) residue is in boldface 



were closely related. Their CD spectra were substantially 
different from that of htel22, notably the absence of the 
sharp negative trough at ~260nm, pointing to the 
adoption of an alternative folding topology. Melting 
analyses of the three sequences were carried out by moni- 
toring the U V absorbance at 295 nm (43), and the melting 
temperatures (r m ) of htel22, htel27 and htel27[Br22] were 
found to be 57.6, 44.5 and 50.8°C, respectively, in 
~100mM Na + (Supplementary Figure S4). The Br G- 
substituted oligonucleotide displayed a T m ~6°C higher 
than that of the natural counterpart, consistent with ob- 
servations from previous studies (20,51). 

NMR spectral assignments 

To proceed with the structural elucidation of 
htel27[Br22], guanine imino (Figure 2A) and adenine/ 
guanine H8 (Figure 2B) protons were unambiguously 
assigned using site-specific low-enrichment 15 N labeling 
(44) and site-specific 2 H labeling (45), respectively 
(Supplementary Table S2). The guanine imino and H8 
proton assignments were further corroborated by 
through-bond correlations at natural abundance (46) 
(Figure 2C). Assignments of thymine residues were sup- 
plemented by T-to-U substitutions (Supplementary Table 
S2). The H8/H6-H1' NOE sequential connectivity of 
htel27[Br22] (Supplementary Figure S5) was completed 
with the assistance of other through-bond correlation ex- 
periments (COSY, TOCSY and {'H-^Q-HSQC; data not 
shown) (47). The strong intensity of intraresidue H8-H1' 
NOE cross-peaks for G4, G5, G10, G16 and G23 
(Supplementary Figure S6) indicated that these guanine 
residues, together with Br G22, adopt the syn glycosidic 
conformation. All other guanine residues adopt the anti 
glycosidic conformation. 

Htel27[Br22] adopts a new antiparallel (2+2) folding 
topology in Na + solution 

Having determined the guanine imino and H8 proton as- 
signments for htel27[Br22], the alignment of the tetrads 
could then be deduced based on characteristic cyclic 
imino-H8 NOE connectivity patterns within each tetrad 
(Figure 3A and B). The top G-tetrad (G6.G24.G16-G10) 
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Figure 1. (A) NMR imino proton spectra and (B) CD spectra of human telomeric sequences: htel22, htel27 and hte!27[Br22] in ~100mM Na 
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Figure 2. NMR spectral assignments of htel27[Br22] in Na + solution. (A) Guanine imino protons were assigned based on l5 N-filtered spectra of 
samples, 2% 15 N-labeled at the indicated positions. The reference spectrum (ref.) with the respective guanine imino proton assignments is shown at 
the top. (B) Examples of adenine and guanine H8 proton assignments through site-specific 2 H labeling at the indicated positions. (C) Through-bond 
correlations between guanine imino and H8 protons via 13 C5 at natural abundance, using long-range /-couplings shown in the inset. Missing/weak 
correlations are framed in boxes. 



is oriented in the opposite hydrogen-bond directionality 
with respect to the other two G-tetrads, 
(G5.G11.G17.G23) and (G4.G12.G18. Br G22), and 
the glycosidic conformations of guanines are 
antiuantiusynusyn for all three tetrads. The central pos- 
itioning of (G5»G1 1»G17»G23) in the tetrad core agrees 
with solvent exchange data showing that the imino 
protons from this G-tetrad were the most protected 
(Supplementary Figure S7). Connecting the sequential 
guanine residues and joining the corners of the G-tetrad 
core with the linking TTA loops, we derived an antipar- 
allel G-quadruplex scaffold consisting of two adjacent 
strands pointing up and two remaining strands pointing 
down (up-up-down-down) or a (2+2) G-quadruplex 
scaffold (14) with a successive loop arrangement 
edgewise-propeller-edgewise (Figure 3C). 

Na + solution structure of the htel27/Br22J G-quadruplex 

The structure of htel27[Br22] G-quadruplex in Na + 
solution (Figure 4) was computed on the basis of NMR 
restraints (Table 2). There are one wide, one narrow and 
two medium grooves. The first (edgewise) loop crosses the 
top of the narrow groove, the second (propeller) loop tra- 
verses across one of the medium grooves, while the third 



(edgewise) loop completes a turn over the bottom of 
the wide groove. The 5'- and 3'-flanking TTA nucleotides 
take part in hydrogen-bond and stacking interactions at 
the bottom and the top of the structure, respectively. 
There is a Watson-Crick A9»T25 base pair capping the 
top of the tetrad core. A15 from the propeller loop also 
stacks over the top G-tetrad, potentially establishing inter- 
actions with the loop elements at the top. T14 from the 
same loop projects into the medium groove, whereas T13 
flips out of the propeller loop. At the bottom of the struc- 
ture, A3 and T20 establishes a Watson-Crick base 
pair, with A21 completing the formation of a triple 
platform. These stabilizing interactions of the loop and 
terminal residues would agree with the observation of add- 
itional broad imino proton peaks at ~12.2-14.0ppm 
(Supplementary Figure S8). The establishment of base 
pair stacking interactions across the ends of the tetrad 
core could have provided considerable contribution 
toward the adoption of the major conformation of 
htel27[Br22J, consistent with previous observations re- 
garding other human telomeric G-quadruplex structures 
formed in K + solution (15-20). In a longer sequence 
context, the various structural forms could exist in a 
state of equilibrium, with possible interconversion 
among them. 
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Figure 3. Determination of the folding topology of htel27 [ Br22 ] . (A) 
NOESY spectrum (mixing time, 200 ms) showing the imino-H8 con- 
nectivity of htel27[ Br22] . Cross-peaks that identify the arrangement of 
the G-tetrads are framed and labeled with the residue number of imino 
protons in the first position and that of H8 protons in the second 
position. (B) Characteristic guanine imino-H8 NOE connectivity 
patterns around a G a »Gp»G 7 »G5 tetrad as indicated with arrows (con- 
nectivity between G5 and G„ implied). (C) Schematic structure of 
htel27[Br22]. Ami guanines are colored in cyan, while syn guanines 
and 8-bromoguanine are colored in magenta. The backbones of the 
core and loops are colored in black and red, respectively. W, M and 
N represent wide, medium and narrow grooves, respectively. 



Novel features of the htel27 / Bv22 ] G-quadruplex 

We have shown that in Na + solution, htel27[Br22] 
assumes a novel antiparallel (2+2) folding topology 
(Figure 3C). Although the tetrad cores of htel27[Br22] 
(this work) and htel22 (23) possess the same relative 
strand orientations, they differ in the hydrogen-bond 
directionalities of the tetrads: anticlockwise-clockwise- 
clockwise for hteI27[Br22] versus clockwise- 
anticlockwise-clockwise for htel22 (Supplementary 
Figure S9). Loop arrangement of htel27[Br22J 
(edgewise-propeller-edgewise) is also different from that 
of htel22 (edgewise-diagonal-edgewise). To date, antipar- 
allel (2+2) G-quadruplexes that have been investigated 
largely consist of edgewise and/or diagonal loops 
(20,23,52-54), while quadruplexes containing propeller 
loops mostly belong to the all-parallel-stranded (21,22) 
or (3+1) (15-19,55-57) core topology. Antiparallel (2+2) 
G-quadruplexes with a propeller loop (58), such as that 
formed by htel27[Br22], have not been observed in the 
context of human telomeric repeats. 



CONCLUSION 

The antiparallel (2+2) G-quadruplex scaffold adopted by 
htel27[Br22] (and presumably a subpopulation of htel27), 
as well as the observation of the coexistence of multiple 




Figure 4. Stereoviews of the htel27[Br22] G-quadruplex structure in 
Na + solution. (A) Ten superimposed refined structures. (B) Ribbon 
view of a representative structure. Ami guanines are colored in cyan; 
syn guanines and 8-bromoguanine, magenta; adenines, green; thymines, 
orange; backbone and sugars, gray; 04' atoms, red; phosphorus atoms, 
yellow. 



Table 2. Statistics of the computed structures of the 27-nt 
Br G-modified human telomeric sequence 
d[(TTAGGG) 3 TTA( Br G)GGTTA] (htel27[Br22 ]f 

A. NMR restraints 

Distance restraints D 2 0 H 2 0 

Intraresidue distance restraints 446 0 

Sequential (/', /' + 1) distance restraints 197 26 

Long-range (;, >/' + 2) distance restraints 79 32 

Other restraints 

Hydrogen bond restraints 48 

Dihedral restraints 12 

Planarity restraints 3 

Repulsive restraints' 5 2 

B. Structure statistics for 10 molecules following distance-restrained 
MD refinement 

NOE violations 

Number (>0.2A) 0.800 ± 0.872 

Maximum violation (A) 0.195 ± 0.050 

RMSD of violations (A) 0.022 ± 0.002 

Deviations from the ideal covalent geometry 

Bond lengths (A) 0.003 ± 0.000 

Bond angles (°) 0.694 ± 0.007 

Impropers (°) = 0.367 ± 0.004 

Pairwise all heavy atom RMSD values (A) 

All heavy atoms except Tl, T2 1.04 ± 0.12 

All heavy atoms 1.49 ± 0.35 

a PDB ID: 2MBJ. 

b Distance restraints between pairs of protons that do not exhibit NOE 
cross-peaks. 



conformations in the series of four-repeat natural human 
telomeric sequences in Na + solution, point to additional 
diversity than had been previously assumed. Even though 
intracellular Na + content is ~10 times lower than that of 
K + , these structures observed in Na + solution could po- 
tentially take part in the interconversion between various 
quadruplex forms. 
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